Quantized HMMs for low footprint text-to-speech synthesis

نویسندگان

  • Alexander Gutkin
  • Xavi Gonzalvo
  • Stefan Breuer
  • Paul Taylor
چکیده

This paper proposes the use of Quantized Hidden Markov Models (QHMMs) for reducing the footprint of conventional parametric HMM-based TTS system. Previously, this technique was successfully applied to automatic speech recognition in embedded devices without loss of recognition performance. In this paper we investigate the construction of different quantized HMM configurations that serve as input to the standard ML-based parameter generation algorithm. We use both subjective and objective tests to compare the resulting systems. Subjective results for specific compression configurations show no significant preference although some spectral distortion is reported. We conclude that a trade-off is necessary in order to satisfy both speech quality and low-footprint memory requirements.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Low memory acoustic models for HMM based speech recognition

In this paper, we propose a new approach to reduce the memory footprint of HMM based ASR systems. The proposed method involves three steps. Starting from the continuous density HMMs, mixture variances are tied using k-means based vector quantization. Next, the reestimation of the resulted models is performed with tied variances. Finally, scalar quantization is performed for the mean components ...

متن کامل

Distributed Text to Speech Synthesis for Embedded Systems – An analysis

his work we describe the design of a Text to Speech S) synthesizer for embedded devices using Flite, a ll footprint text to speech system. We outline two hods for implementation of a TTS on a low resource ice. The first method focuses on porting the entire hesizer onto an embedded device directly. The second hod explores possibilities of using distributed speech hesis when the available memory ...

متن کامل

Speech recognition using HMMs with quantized parameters

In this paper we describe the structure and examine the performance of a recognition engine based on hidden Markov models (HMMs) with quantized parameters (qHMM). The main goal of qHMMs is to enable a low complexity implementation without sacrificing the classification performance. In the tests with a whole word digit dialler engine and a phoneme based isolated word recognizer we managed to pre...

متن کامل

Comparison of low footprint acoustic modeling techniques for embedded ASR systems

In this paper we compare the performance of speech recognition systems based on hidden Markov models (HMM) with quantized parameters (qHMMs) and subspace distribution clustering hidden Markov models (SDCHMMs). Both of these HMM types provide similar performance as continuous density HMMs, but with significantly reduced memory requirements (approximately 90% less memory was needed to store the H...

متن کامل

Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling

In this paper we present a method for speech modeling and its utilization in IBM’s small footprint concatenative text-tospeech system. The method is based on frequency-domain, complex spectral envelope modeling, where the phase component plays a crucial role in attaining high quality speech synthesis. The modeling scheme presented enables low bit rate compression of the amplitude and phase info...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010